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Abstract — Maximum likelihood decoding of 
long block codes is not feasable due to large 
complexity. Some classes of codes are shown to 
be decomposable into multilevel concatenated 
codes(MLCC) [3]. For these codes, multistage 
decoding provides good trade-off between per- 
formance and complexity. In this paper, we de- 
rive an upper bound on the probability of block 
error for MLCC. We use this bound to evaluate 
difference in performance for different decom- 
positions of some codes. Examples given show 
that a significant reduction in complexity can 
be achieved when increasing number of stages of 
decoding. Resulting performance degradation 
varies for different decompositions. A guide- 
line is given for finding good 111 -level decompo- 
sitions. 

I. Introduction 

Multistage Decoding (MSD) is an efficient soft deci- 
sion decoding method for long decomposable codes, 
such as multilevel concatenated codes (MLCC) [1]. Al- 
though suboptimum in performance, it greatly reduces 
the computational complexity as compared to opti- 
mum decoding. 

Since its inception in Hemmatrs paper [2], Closest 
Coset Decoding (CCD) of |u|ti-f- v\ codes and general- 
izations of CCD have been investigated by several au- 
thors. However, only a few papers on the performance 
analysis of this method have appeared [3-6]. Further- 
more, these methods are restricted to |u[u + u| codes 
and some others to block modulation codes. The dif- 
ferent code-structure of MLCC’s precludes application 
of the analysis therein to the case of MLCCs. 

In [3], an upper bound on the effective error coeffi- 
cient (EEC) for 2-stage decoding of MLCC’s was de- 
rived and some guidelines for choosing a good 2-level 
decomposition of Reed Muller codes were given. 

In this paper, we first derive an upper bound on block 
error probability of MSD of MLCC’s, when optimum 
decoding of each stage is performed. We first express 
the upper bound in terms of all the error coefficients, 
and then explain how these coefficients can be ob- 
tained using some combinatorial methods and weight 
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distributions of inner and outer codes of a MLCC. The 
bound enables prediction of the performance of MSD 
(two or more levels) without simulation. Therefore an 
estimate of performance degradation, when the num- 
ber of decoding stages is increased, can be obtained. 

II. Analysis of CCD of Multilevel 
Concatenated Codes 

For an M-level code C = {B 1 , B 2 , . . . , B^/} * 
{Ai, A 2 , . . . , A,v/}, consider the i-th stage of decoding 
for 1 < i < M . At each stage, decoding is assumed to 
be complete (an estimate of a transmitted codeword is 
always given) soft decision maximum likelihood closest 
coset decoding [5]. 

Let denote the probability that the i-th stage de- 
coder makes an error when all previous (i— 1) stages are 
correct. Then, conditioning on the i-th stage decoding 
being in error, we obtain P e — ^ ■ * n o enera ^ 

P^ depends on the codeword at the output of the i-th 
inner encoder. However, for binary linear codes, BPSK 
transmission and the AWGN channel, P^ is the same 
for all transmitted codewords [7]. For simplicity, we 
assume that the all-zero codeword is transmitted. In 
this case, the output of each B,-encoder at the trans- 
mitter is the all-^ero codeword in B t . The received 
(n A nB)-tuple r = r^, . . . , r^ ne) ) is sectionalized 

into tiq sections each of length n Ai where ng and n A 
are lengths of outer and inner code respectively. 

E 11 error at the i-th stage, when CCD is used, occurs 
if a nonzero codeword b = (b\ > 6o, . . . , b nB ) 6 B t has 
larger correlation than the all-zero codeword z £ B t . 
Let a ; = (a ; lf ai, . . . , a J nA ) be the codeword with the 
best correlation in the coset corresponding to j - th sym- 
bol bj , for j = 1, 2, . . . , ns- If a standard BPSK map- 
ping (0 — *■ 1,1 1) is applied, then the difference 

in metrics of b and z can be bounded by the following 
expression: 

iW(b) — A/(z) < YL E M^'). (1) 

where M(-) denotes correlation metric. Thus the prob- 
ability of error at the i-th stage is given by 

P li) = P(M{ b) - M{z) > 0, b € Bj, b # 0). (2) 



For a particular a and b, 


T(a,b)= £ £ (~ 2 + 

J=l,bj?0 f=l a J = L 

is a Gaussian randome variable with mean /j and 
variance^ 2 given by 

js = 2A ^ u;(a), cr 2 = 4 u;(a) 

*>j*0 b j7 L0 

where A is the amplitude of a transmitted BPSK signal 
(SNR=10 log (A 2 /2 k/n)). It then follows that 


p{i) < E E E e(T(a,b)>0) 

bgBj *i*° aefl(bj) 


beBj *i^oa€n(4,-) 
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where fi(6y) represents the coset corresponding to the 
symbol bj t and w(-) the weight of a codeword. 

Note that for any 0 ^ b 6 B* 0 ^ a G A,-, 

u; ( a ) ^ d m i n (C) 

Definition 1: Define Si(Y) as the number of terms 
of the form ^ erfc ^Ay/Y/2^ satisfying 

i- y = 

2. b G Bj,a E Cl(bj) 

The union bound on the probability of error at the 
t-th stage of decoding is then given by 

< £>(y)^erfc (/Iv'm) (4) 

where the summation is taken over ail Y defined above. 
Thus 


< E E 5 '( y ) \ erfc (-4072) • (5) 

Si(Y) depends on the symbol weight distribution of 
the i-th outer code B,, the Hamming weight distribu- 
tion of the MLC code C and critically on the weight 
distribution (of n^-tuples) in the cosets of the parti- 
tion Ai/At+i. For inner codes of small lengths the 
coset weight distribution can be evaluated easily by an 
exhaustive computer search program or using combi- 
natorial methods. 

III. Results for some MLCC codes 
Consider the decomposition 

RM(64, 42, 16) = {(8, 1. 8)(8, 4, 4) 3 , (8, 7, 2) 3 (8, 8, 1)} * 

{(8,8, 1), (8, 4, 4), (8, 0)} 


given in [3]. The symbol weight distribution of B\ is 

A'b,(0) = 1, N Bl (4) = 98,A r Bl (6) = 1, 176, N Bl (7) = 
1, 344, N Bx {8) = 5,573. The symbol weight dis- 
tribution of Bn is Nb?(0) = 1, Nb 7 ( 1) = 8, 

/Vb 3 ( 2) = 812, N Bl { 3) = 23,576, N Bj { 4) = 443,030, 
Nb 7 (5) = 5,315,576, A'b 3 (6) = 39,867,212,7^(7) = 
170,859,3 68, N B *(8) = 320,361,329. The coset 
weight distributions of A^/A 2 and A 2 /{0} are given 
in Table 1. 

To compute S»(Y) with* = 1,2 , we have to deter- 
mine how many n^n^-tuples of weight Y can result 
from a b G B^ of symbol weight ^(B^) = AY In 
order to answer this question, we must see if a parti- 
tion of the integer Y into X parts is possible with each 
part belonging to a valid entry in the coset weight dis- 
tribution of the partition (8,8, l)/(8,4,4). A partition 
of the form 8 = 2 + 3+14-2 is invalid because none 
of the non-zero cosets in the above partition have a 
vector of Hamming weight 1 or 3. Furthermore, the 
partitions must be counted in an ordered fashion (i.e., 
we are counting the so called distributions of an in- 
teger). For example, there are 12 distributions of 16 
into parts {4, 4, 2, 6}. The partition table for first two 
nonzero weights in both first and second stage are given 
in Table 2. 

The effective error coefficient in stage- 1 is determined 
as Si(8) = 98 x 256 = 25,088. The next two error 
coefficients are 5^ (10) = 98 x4x 4 3 x8 = 200704 and 
Si ( 12) = 9S x (6 x 8 2 x 4 2 + 4 4 ) + 4 6 x 1 176 = 5444096. 
The higher order error coefficients can be computed in 
a similar manner. 

In the second stage of decoding, the effective error 
coefficient is St* (8) = 8 + 784 = 792 and the er- 
ror coefficient So (12) = 18,816. The first three er- 
ror coefficients for this 2- level decomposition of RM 

(64.42.8) corresponding to weights 8, 10 and 12 are 
Si(8) + Sn(8) = 25,880 , Si (10) = 200,704 and 
5i(12) + S 3 (12) = 5,462,912. 

In summary, for the Reed-Muller code C = 

(64.42.8) , an upper bound (with first two significant 
terms) on the block error probability with two-stage 
decoding is 


-Fc,2 — stage ^ 2.5 Iff*! erfc + 

2.0 10 5 1 erfc (4^10/2) + 5. 462, 912^ erfc {a^/\2/2^ . 

The cop>plexitv of stage-2 can be reduced by employ- 
ing a 3-stage decomposition 


RM(64, 42, 8) = 

{(8, 1, 8)(8, 4, 4) 3 , (8, 7. 2)\ (8, 7, 2)(8. 8,1)}* 

{(8,8,1), (8, 4,4), (8, 2, 4), (8,0)} (6) 


for which 



Pe, 3-stage < 2 ' S ^ \ erfc n/S/^) + 

2.0 10 5 ^ erfc (.4%/l0/2) 4- 5.4 10 6 - erfc ^.4>/T2/2^ . 

Notice that the asymptotic loss in decoding gain, due 
to this simplification, predicted from the bound is prac- 
tically zero, and this agrees with actual simulation re- 
sults shown in Figure 1. 

Table 1 


Coset- #’s 

Weight of vector 


0 

2 

4 

6 

8 

T “" 0 H 

1 

- 

14 

- 

1 

1-7 

- 

4 

8 

4 



Table 2 


w,(b) 

Partition 

Si 

iVo 

4 

{2, 2, 4, 4} 

6 

4^ 8- 6 

4 

{2, 2, 2, 6} 


4*4 

4 

12,2,2,4} 

4 

4 a 8 

5 ~1 

Not possible 


- 

7 

Not possible 

- 

- 

6 

12,2,2,2,2,2} 

1 

4 b 


jVj =Number of partitions. iVo = Number of code- 
words in Bj * [Aj/Aj+i] per partition. 

For long codes, even 2-stage CCD cannot be per- 
formed in an optimal manner due to the complexity 
of the trellises involved. Hence it is imperative that 
tight bounds be derived so that the loss of coding gain 
at high SNRs due to m-stage (m > 3) can be predicted. 

The loss in coding gain of 2-stage relative to m- 
stage decoding can be determined based on the er- 
ror coe'ficients derived in the paper. For the code 
C = (128,64,16) RM. and n = 2,3, this is shown 

in Figure 2. 

The corresponding expressions are 

Pe, 2 — stage < 13.8 lO 6 ^ erfc (a/RV 2 ) + 

5.1 10 9 ^ erfc (a%/ 24/2) 

Pe, 2 -,t ag e < 1.6 x 10 9 ^erfc ( a >/ 16 / 2 ) + 

6.1 x 10 13 ^ erfc (.4\/24/2) 

The three stage decomposition has a trellis complex- 
ity of 512, 128, and 512 states for the first, second, 
and third stage, and the average number of operations 
per information bit is 1263. The most of the computa- 
tional complexity is due to a large and fully conected 
trellis of the third stage, so a wav to reduce it is to im- 
plement 4-stage decoding by dividing the third stage 
into two stages. 


The difference between the bounds for 2-stage and 3- 
stage at block error rate 10” 6 is about 0.6 dB. Since 
the simulation of 2-stage decoding with MLD at each 
stage is not available, we compared the results of 3- 
stage decoding with a 2-stage decoding where the first 
stage is decoded using iterative search. We believe this 
would be very close in performance of optimum 2-stage 
decoding. The difference in performance for this 2- 
stage and optimum 3-stage decoding is 0.6 dB as well. 
The third example is done for the (72,52,8 decom- 
posable code. Bounds for 3 and 4-stage decoding of 
this code are given in Figure 3. 

The state complexities are 2, 512,8, 1 for the 4-stage 
decoding, and (1024,8, 1} for the 3-stage decoding. 

IV. Trade-offs between complexity and 

PERFORMANCE 

We have shown in the previous section that the perfor- 
mance of 2 and 3-stage decoding of (64, 42, 8) code are 
almost identical. However, this is true only for 3-stage 
decomposition (6). Reduction in state complexity is 
not significant, since maximum number of states in the 
first stage remains 128 (the same as for 2-stage decom- 
position) and the maximum numbers of states at the 
second and third stage are 4 and 2 respectively (as com- 
pared to 8 states in 2-stage decomposition). The aver- 
age number of operations per information bit, though, 
is reduced from 393 to 133 (almost three times). 

For another 3-stage decomposition, 

RM(64, 42, 16) = 

{(8, 1,8)(8,4,4),(8,4,4) 2 ,(8,7,2) 3 (8,8, 1)} * 

{(8,8, 1), (8, 6, 4), (8, 4, 4), (8, 0)} , (7) 

maximum number of states is reduced to 16 (in the 
second stage), so 8 times compared to two stage de- 
compoition. Unfortunately, this decomposiyion loses 
about 0.5 dB in performance, and the average num- 
ber of operations per information bit is 22/ . Thus the 
3-stage decomposition (6) outperforms decomposition 
(7), while having smaller computational complexity. 

We can see that the performance of the multistage 
scheme depends mainly on the first decoding stage. 
This suggests that when increasing the number of 
stages, lower level stages should be divided, while the 
first should be kept as long as the number of states 
in the outer code trellis is not too big. In such man- 
ner, the performance degradation will not be large (in 
some cases almost negligible). Since lower level codes 
in multilevel decomposition are high rate codes with 
complex trellis structure, splitting each of them into 
two levels will reduce the complexity significantly (as 
we have seen in the example of RM(64,42,8) code). 
This can be used to determine a good decomposition 
of a code. 



V. Conclusion 

In this paper, we have shown how to bound the per- 
formance of multistage decoding of Multilevel concate- 
nated codes. We presented examples which show that 
the simulation results converge to derived bounds at 
moderate to high signal-to-noise ratios. Therefore, an 
estimate of the difference in performance for differ- 
ent decompositions and different numbers of decod- 
ing stages can be determined. Analysis of reduction 
in complexity when using large number of decoding 
stages has also been discussed. 



Figure I: Simulation results and performance bounds 
for RM(64,42,8) code 



Figure 2: Simulation results and performance bounds 
for RM(128,64,16) code 
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Figure 3: Performance bounds for (72,52,8) decompos- 
able code 






